VisIVOWeb: A WWW Environment for Large-Scale 
Astrophysical Visualization 
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Abstract 

This paper presents a newly developed WWW environment called VisIVOWeb aiming at pro- 
viding the astrophysical community with powerful visualization tools for large-scale datasets in 
the context of Web 2.0. VisIVOWeb can handle effectively modern numerical simulations and 
real-world observations. Our software is open-source based on established visualization toolk- 
its and offers high quality rendering algorithms. The underlying data management is discussed 
together with the supported visualization interfaces and movie making functionality. We intro- 
duce VisIVOWeb Network, a robust network of customized web portals for visual discovery and 
VisIVOWeb Connect, a lightweight and efficient solution for seamlessly connecting to existing 
astrophysical archives. A significant effort has been devoted for ensuring interoperability with 
existing tools by adhering to IVOA standards. We conclude with a summary of our work and a 
discussion on future developments. 

Subject headings: Scientific Visualization, Cosmological Simulation, Large-Scale Datasets, Visual Dis- 
covery, High-Perforniance Rendering, Web 2.0. 



1. Introduction 

The primary goal of scientific visualization is to 
create images and animations to aid scientists in 
understanding complex datasets, e.g. large-scale 
multi-dimensional datasets obtained either from 
numerical simulations or real-world astrophysical 
observations. 

The typical sizes of modern astrophysical datasets 
are increasing steadily due to continuous advances 
in computational power and instrumentation. As 
a result such datasets often require storage in a 
distrib uted way, e.g. the Millenium-II Simula- 






tion (ISpringel at al. 20051 ) which produced nearly 
20 TBs. Present and next-generation sky sur- 
veys are also of constantly increasing sizes and 
complexity, e.g. the Sloan Digital Sky Survey 
( Abazaiian et al. 20091 ) that offers a catalogue 



of 15.7 TBs. Moreover a significant effort has 
been devoted in handling highly-complex numeri- 
cal simu lations in the context of the Virtual Obse r- 
vatory (jPadovani et al. 2Qo3 lAUam et al. 200 
e.g. the Italian Th eoretical Virtual Observator;,_ 
( Costa et al. 20081 ) a nd the German Astrophysica l 
Virtual Observatory ( Bovlan-Kolchin et al. 20091 ) 
offering dark matter halo catalogues and merger 
trees from the Millennium-II. 
Modern visualization can aid astrophysicists in 
gaining good insights of highly-complex datasets, 
through rapid and intuitive discovery of famil- 
iar patterns and correlations between properties, 
without involving CPU intensive analysis codes. 
As a result, suitably-constructed visualization 
tools can be instrumental for future astrophysi- 
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cal advances, e.g. by allowing comparing simula- 
tions meaningfully or even correlating appropri- 
ately simulations and observational datasets. As 
an example consider the massive halo formation 
process, or interaction among galaxies as seen in 
the evolution of a simulation and similar observed 
objectcl. 

This article presents a WWW environment for 
visualization of large-scale astrophysical datasets 
that can be installed easily in data centers hosting 
the datasets, offering astrophysicists an array of 
advanced visualization tools exploited effectively 
using common Internet browsers. VisIVOWeb is 
our open-source visualization software; it is hosted 
by sourceforge.ncl|j and is released under GNlo li- 
cense. The latest version can be accessed through 
Subversioqj, an open-source revision control sys- 
tem. 

The related work is outlined in section 2. The 
main ingredients of VisIVOWeb are then intro- 
duced, namely handling of large-scale datasets, 
data management services and supported visu- 
alization and exploration interfaces, and finally 
VisIVOWeb Network and VisIVOWeb Connect 
for networking VisIVOWeb portals and connect- 
ing them seamlessly to astrophysical archives. The 
way users can gain access to the system and im- 
port their datasets is described in sections 3 and 
4 respectively. Section 5 describes the underly- 
ing data-model required to support VisIVOWeb 
functionality and outlines a typical operational 
scenario. The visualization algorithms supported 
together with the relevant visualization interfaces 
are presented in sections 6 and 7. We discuss the 
creation of scientific movies in section 8 focusing 
on a specific example for demonstration. We then 
outline VisIVOWeb Network and VisIVOWeb 
Connect together with our recent dissemination 
activity. Finally we conclude with a summary of 
our work including pointers to some future devel- 
opments. 

2. Related works 

VisIVOWeb implements a customized WWW 
interface wrapped around VisIVOScrvcr that 



can be used through common Internet browsers. 
We have impleme nted VisIVOScrvcr previously 
(JBecciani et al. 20 101 through a comprehensive 
collection of modules for processing and visual- 
ization of astrophysical datasets. The software 
is open-source, written in C-|— I- and hosted by 
sourceforge.neCl. The underlying visualization 
engine is founded on the functionality of th e Vi- 
sualisation Toolkil|3 (jSchroeder at al. 20061 ). As 
VisIVOWeb is wrapped around VisIVOScrvcr, it 
offers fast rendering of customized 3D views of 
large-scale astrophysical datasets in a WWW en- 
vironment that provides a unique mix of data 
management and multi-dimensional visualization 
tools. 

To the best of our knowledge, no other compara- 
ble environments exist in the Virtual Obs ervatory 
(jPadovani et al. 20041 : lAllam et al. 20051 ). Typ- 



ically the rendering process is computationally 
intensive often requiring millions of floating-point 
and inte ger operation s for generating single 3D 
views (ICrocket 1997| ). This is the reason for 
developing a fully C-l — I- implementation rather 
than optimizing other exist ing Java-bas ed envi- 
ronments, e.g. TOPCAT (JTavlor 2005h . Com- 
pared to TOPCAT our rendering capability is 
significantly enhanced as we can handle large- 
scale mutli-dimensional datasets. The visual- 
ization functionality provided by VisIVOWeb is 
also very different compared to a plotting tool 
su ch as VoPlot (iKale et al. 20041 ) and Aladin 
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(jBonnarel et al. 20001 ). The latter is simply a 
software sky atlas allowing the user to visualize 
digitized astronomical images and to also super- 
impose entries from astronomical catalogues or 
databases. 

Among the visualization algorithms provided 
by VisIV OWeb for handlin g point-like dat asets 
Splotch (JDolag et al. 20081 : Ijin et al. 201o| ) de- 
serves particular attention. A customized ray- 
tracer is employed for creating meaningful 3D 
views of modern cosmological simulations effec- 
tively. The intensive underlying calculation is 
optimized by pre-procesing the relevant parti- 
cles, and recent developments exploit CPUs for 
hardware acceleration. The rendering is achieved 
through composition of the final color in each pixel 
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by calculating emission and absorption of individ- 
ual volume elements. Splotch is released under 
GNU license and is integrated in VisIVO Server 
fully. We focus on visualization interfaces within 
VisIVO Web in section 7. The next section out- 
lines the typical user scenario for gaining access 
to our system. 

3. System Access 

VisIVOWeb is supported fully by commonly 
used Internet browsers, e.g. Internet Explorer, 
Firefox, Chrome, Safari and Opera. JavaScript, 
Java, pop-up windows and cookies must be en- 
abled. 

The system can be accessed using either of the 
following modes: authorized or anonymous. To 
employ an authorized account, users are required 
to complete a web form; their system account is 
then activated following approval, typically in a 
couple of hours. On the other hand, anonymous 
accounts do not require approval, they are simply 
created through the home page of the system and 
they are activated immediately. These accounts 
employ randomly generated usernames and empty 
passwords. Assuming there is sufficient amount of 
disk space available, there is no limit on the max 
number of anonymous accounts employed by the 
system. 

All users (no matter if anonymous or authorized 
access) are provided with their own individual 
staging area within the system for uploading and 
managing not only their datasets but more im- 
portantly the visualizations produced from these 
datasets, e.g. images and movies. For anonymous 
accounts, although access to full functionality for 
processing and visualization is provided, they are 
valid for no more than four days since last user 
access. At the end of this period all user uploaded 
datasets and created visualizations are removed 
automatically. Authorized accounts are valid for 
two months since last user access. Prior to expiry 
a warning e-mail is sent, then a simple login into 
the system by the user can extend the validity of 
an account automatically. The typical scenario for 
importing datasets into the system is outlined in 
section 4. 



4. Importing Datasets 

Before any analysis and/or visualization is per- 
formed, user datasets must be imported into 
VisIVOWeb. The system currently supports 
most commonly employed astrophysical data 
format s e.g. ASCII and CSV ta bles, VOTa- 
bles (|Ochsenbein fc Wilhams 20091) . FITS Ta- 
bles, GADGET-2 (jSpringel 20051 ). Raw Binary 
and Raw Grid. Furthermore other data formats 
arc supported either targeting specific applica- 
tions that are used by a sizable scientific com- 
munity or being of particular scientific interest. 
Consequently the system supports the outcomes 
of the code FLY (|Becciani et al. 2007t) and TVO 
XML, a VOTable that can be used to describe the 
content of generic binary files. This descriptor is 
currently under revision by the Theory Interest 
Group within the IVOA frameworlo. 
Several interfaces are provided for importing 
depending upon the data format of the in- 
put datasets. VisIVOWeb can handle point 
clouds as well as volumetric datasets; the lat- 
ter typically consist of collections of scalar val- 
ues arranged on regular grids. Furthermore 
point clouds can be transformed into volumet- 
ric d atasets for visualization us ing a cloud in 
cell (JHocknev &: Eastwood 19811 ) smoothing al- 
gorithm. 

4.1. VisIVO Binary Tables 

Datasets imported into VisIVOWeb are always 
converted into an internal data representation 
called VisIVOBinary Table (hereafter VBT). A 
VBT can describe both point datasets and vol- 
ume datasets; for detailed information on its 
structure and header the reader is referred to 



(jBecciani et al. 20101 ) . The importing of a dataset 
may produce one or more VBTs, e.g. a GADGET- 
2 snapshot can generate six VBTs corresponding 
to different particle types associated with gas, 
halo, disk, bulge, stars and boundary. 

4.2. Process 

The importing interface can be deployed by a 
local or a remote upload. The local upload is a 
synchronous operation to transfer a dataset from 
the user's PC to VisIVOWeb handled fully by 
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the user's browser. However in case of large-scale 
datasets (e.g. more than 100 MBs) the suggested 
solution is a remote upload. This will perform 
an asynchronous transfer from a remote location 
to VisIVOWeb. The system supports most com- 
monly used transfer protocols, e.g. http, sftp and 
ftp. 

To import using remote upload, users are simply 
required to provide the address of a URL location 
in which the relevant datasets are stored. A lo- 
gin and password can be provided optionally to 
grant access to the remote resources. An e-mail 
is sent to the user at the end of the process re- 
porting the details of the transfer operation. The 
overall time for importing consists of uploading, 
converting and creating the underlying database 
objects. 

• Uploading: This depends on the size of the 
data file to be processed and on the network 
bandwith. 

• Converting: This depends on the size of the 
data file and on the data type and number 
of tables that it contains. This operation is 
required for conversion of datasets into the 
internal data representation as described in 
section 4.1. 

• Creating Database Objects: This is typically 
a very short period of time that is required 
for creating all the necessary data structures 
and metadata related to importing. Section 
5 describes the details of the database ob- 
jects. 

The graph in figure [T] shows the scaling of overall 
importing times versus dataset sizes. Our tests 
were performed on a dual core AMD Opteron 
processor 280 with 4GBs RAM. Several binary 
datasets were imported varying their sizes from 
380 MBs up to 12 GBs. The linear scaling ob- 
served suggests that handling large datasets is ef- 
fectively restricted only by underlying file system 
limitations. 

5. Data Management 

The underlying data-model is implemented 
through a relational database. This allows users 
their private staging area within the system for 
managing their datasets as well as images and 



movies produced from such datasets. Typically a 
number of operations arc employed prior to any 
analysis and/or visualization, e.g. for extracting 
interesting features. This section describes the 
developed database objects (sections 5.1 and 5.2), 
the relevant interface (section 5.3) and outlines 
a representative number of available operations 
(section 5.4). 

5.1. Metadata Objects 

The metadata (or attributes) associated with 
uploaded datasets are encapsulated by customized 
VisIVOFile objects. These objects are inherited 
from VisIVOMetadata objects, that encapsulate 
attributes such as owner or creation time, while 
adding new attributes such as the file format em- 
ployed (e.g. CSV or FITS Tables). Moreover 
the objects VisIVOTable and VisIVOVolume are 
defined by inheriting from VisIVOMetadata for 
representing VBTs (see section 4.1). VisIVO- 
Table objects introduce attributes such as endian- 
ity, number of fields, type and number of elements. 
VisIVOVolume objects introduce attributes such 
as number and size of cells for each dimension (Fig- 
ure [2|) . As an example consider importing into 
our system GADGET-2 datasets. A VisIVOFile 
object will be generated by the original file and 
importing may produce six (one for each particle 
type) VisIVOTable objects. 

5.2. Operation Objects 

An operation in VisIVOWeb can be described 
as a many-to-many relation connecting VisIVO- 
Metadata objects. We can simplify this rela- 
tion by introducing a VisIVOOperation object 
to treat a many-to-many relation as two one-to- 
many relations (Figure [2]). In other words many 
VisIVOMetadata objects can possibly generate 
several VisIVOMetadata objects by using a sin- 
gle VisIVOOperation. We also define the objects 
VisIVOOpOutList / VisIVOOpInList as aggrega- 
tions of VisIVOMetadata objects. As an example 
consider merging VBTs representing point clouds. 
Then, VisIVOOpInList is the aggregation of the 
original VisIVOMetadata objects corresponding 
to the relevant VBTs while VisIVOOpOutList will 
contain a new VisIVOMetadata, i.e. the result of 
the operation. 
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Fig. 1. — Scaling of overall importing times against dataset sizes. The error bar is calculated from several 
repeated realizations. 
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Fig. 2. — The data model employed in VisIVOWeb: (a) UML class diagrams representing object hierarchy 
and (b) an entity relationship diagram (crow's foot notation). 
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Fig. 3. — The graphical interface employed in our system for data management activity. 



5.3. Operational Scenario 

We provide our system with a graphical web 
interface that allows to easily store data and work 
with them. Data are organized in a tree-like struc- 
ture (Fig. [3]). In this way a user is allowed 
to select one or more datasets and to use them 
as members of an operation or as elements of a 
visualization pipeline. Different objects in the 
data-model such as files, volumes and tables have 
different icons. This interface allows to manage 
object metadata and to select an individual object 
or a whole branch to be deleted. 



5.4. Operations 

A single point VBT is required by the following 
operations: 

• Randomization - A new point VBT is pro- 
duced by randomly selecting particles ac- 
cording to user-prescribed percentage val- 



• Math - Several mathematical operations for 
combining columns of VBTs. A suitable 
drop-down menu is provided to assist users 
in inputting any required algebraic expres- 
sion. 

• Select Rows - This is a multi-dimensional se- 
lector. The user can impose constraints to 
specify min/max values for particular fields 
of a VBT. The result of the operation is a 



new point VBT matching the prescribed cri- 
teria. 

• Append - A new VBT is created from a num- 
ber of VBTs with an identical number of 
columns. 

• Extraction - A new VBT is created by identi- 
fying a region of interest using a box/sphere. 
The user must supply suitable columns to 
be used as Cartesian coordinates, and also 
dimensions and placement of the extracting 
box/sphere. 

• Point Distribution - A point VBT is trans- 
formed into a volumetric VBT. Each point 
is distributed to nearby voxels using one 
of the following algorithms: cloud in cell, 
nearest grid point or triangu lar shape cloud 
(JHocknev fc Eastwood 198l[) . 



A point VBT and a volume VBT are required by 
the following operation: 

• Grid2Point - For distributing the values of a 
volumetric VBT within a point VBT in the 
same domain using the cloud in cell, near- 
est grid point or triangular shape cloud al- 
gorithm. 

The following operations are for volume VBTs 
only: 

• Coarse Volumes - This reduces a VBT us- 
ing a plane for selecting a region of interest 



according to uscr-prcscribcd percentage val- 
ues. 

• Sub Volumes - This operation extracts rect- 
angular regions of interest from prescribed 
VBTs. 

Currently several new operations are planned to 
be implemented; to obtain the most recent devel- 
opments the reader is referred to Sourceforge.neo 

6. Visualization 

The standard visualization views in VisIVOWeb 
employ orthographic projections. The focal point 
is fixed in the center of the box to be observed 
and users can prescribe the camera azimuth and 
elevation. The system supports several algorithms 
based on the Visualization Toolkill^°l and Splotch 
(|Dolag et al. 2008Hjin et al. 2010l ). The former is 
suitable for point clouds and volumetric datasets 
while the latter is a customized ray-tracer suit- 
able for particle-based datasets. To decouple our 
visualization engine from the hosting computer in- 
frastructure a default installation of VisIVOWeb 
employs the Mesa libraries, an open-source im- 
plementation of OpenGL. Nevertheless, a cus- 
tomized installation of the system is feasible so 
as to exploit fully the hardware capability of the 
underlying hosting server. 

6.1. Point Clouds 

Point clouds can be visualized using either 
VTK-based algorithms or Splotch, provided that 
suitable columns are selected from the input VBT 
for a Cartesian system of coordinates to be used 
for rendering. The entire computational box of an 
N-body simulation (z = 0.1) is displayed at the 
top of figure m its size is 70 Mpc/h. The image 
at the bottom illustrates fine details in a sub-box 
from the same simulation. By using predefined 
color tables, scalar values can be mapped to col- 
ors so as to modulate VisIVOWeb rendered views 
based on values of specific properties of individ- 
ual points in a cloud. Apart from colors, user 
defined shapes, e.g. cylinders, cones or spheres, 
can also be attached to particular points so that 
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geometrical characteristics, e.g. height or radius, 
can be modulated according to values of specific 
point properties. Splotch also operates by map- 
ping scalar data (a column of the input VBT) 
to colors by indexing though a set of predefined 
color palettes. A summary of mappings of VBT 
fields to visible elements in the VisIVOWeb views 
is shown in table [H for VTK-based and Splotch 
rendering respectively. 

6.2. Volumes 

Volumetric datasets can be visualized by using 
either volume rendering algorithms or isosurfac- 
ing. Figure [S] illustrates the renderind of a major 
merger galaxy cluster simulated with high spa- 
tial resolution using the ENZO l.jJij cosmologi- 
cal code. The upper image is a temperature pro- 
file visualized using volume rendering. The lower 
image is a density profile visualized using isosur- 
facing. (Box size = 187 Mpc/h; voxels = 544^ ). 
The isosurfaces in the bottom image correspond 
to prescribed constant values attained by the den- 
sity function. Inspecting the rendering results 
for a number of different constant values reveals 
the inner structure complexity of the underlying 
datasets. 

7. Visualization Interfaces 

The visualization interfaces provided within 
VisIVOWeb are controlled by a widget interac- 
tively for setting viewing parameters, e.g. zoom 
in/out factors, azimuth and elevation (Figure [6]). 
This widget is implemented as a Java applet 
controlling an interactive box within the user's 
browser. The user can interact with the widget by 
using either the computer mouse or the buttons 
built into the widget. Once the viewing parame- 
ters are fixed the user can render via the display 
button. 

7.1. Opacity Function and Color Maps 

An interactive editor is provided (Figure [7]) al- 
lowing users to create customized color maps and 
opacity functions for visualization. On the left 
hand side of the interface the opacity function 
and color map are displayed. Using the computer 
mouse a user can edit this opacity function not 
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Fig. 4. — The upper image is the entire computational box of an N-body simulation z = 0.1, box size = 
70Mpc/h, number of particles = 800'^. The lower image is a sub-box of the same simulation. The electronic 
version of the article includes color plates. 




Fig. 5. — A major merger galaxy cluster simulated with high spatial resolution using the ENZO 1.5 cosmo- 
logical code. The upper image is a temperature profile visualized using volume rendering. The lower image 
is a density profile visualized using isosurfacing. Box size = 187 Mpc/h; voxels = 544"^. The electronic version 
of the article includes color plates. 
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Fig. 6. — The interactive widget employed for adjusting rendering settings for a cubical sub-sample in an 
N-body cosmological simulation (z = 0.1). 




Fig. 7. — The widget employed for adjusting opacity functions and color maps. 
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Table 1: Modulating VisiVOWeb rendered views for point clouds. 
X Y Z Color Intensity Radius Height 



VTK-Based 

Splotch 



only by inserting additional nodes but also by up- 
dating or even deleting existing nodes, simply by 
dragging them out of the window. The overall 
opacity function is constructed by linearly inter- 
polating all nodes. The user can also adjust each 
node's color through the color selector, HSB ed- 
itor or RGB editor, the corresponding color map 
is updated in real time. Color maps and opacity 
functions can be saved for later use into system 
templates. 

7.2. Interfaces 

The graphical user interface for VTK-based 
rendering is shown in Figure [5] The user can se- 
lect VBT columns to construct a Cartesian sys- 
tem of coordinates for rendering. The scale option 
is employed for transforming coordinates so that 
rendering occurs within a cubical box if required. 
Changes in the opacity value are typically used for 
revealing and/or masking a dataset's inner struc- 
tures. Finally, using the lookup table option a 
VBT column can be selected to be mapped into 
a color palette. A set of pre-defined color tables 
are accessible by default. The user can also apply 
shapes, e.g. spheres, cylinders or cones, during 
rendering of individual points, modulating their 
geometric characteristics by employing a specific 
property. The graphical user interface for Splotch 
rendering is illustrated in Figure [9l Firstly, the 
VBT columns for constructing the Cartesian sys- 
tem of coordinates for rendering must be cho- 
sen. Subsequently several scalar values need to 
be selected by the user to be associated to inten- 
sity and color, H SML (SPH SMoothing Length) 
(jMonaghan 1988[ ). HSML factor, gas gray absorp- 
tion and brightness respectively. 

8. Scientiflc Movies 

Scientific movies are useful not only to scientists 
for presenting and communicating their research 
results, but also to museums and science cen- 
ters for introducing the general public into com- 



plex scientific concepts. The creation of a movie 
represents a significant challenge for the under- 
lying computational resources as often hundreds 
or thousands of high quality images must be pro- 
duced. For example, creating a fly-through lasting 
5 mins within a 16 million particle cosmological 
simulation requires 38200 seconds in a dual core 
AMD Opteron processor 280 with 4GBs of RAM 
memory. 

8.1. Temporal Parallelism 

To alleviate the high computational demands a 
temporal parallelism is adopted for decomposing 
the overall computation. The fundamental unit of 
computation is a complete frame and an individ- 
ual processor is assigned with a number of frames 
to be rendered. Our system is fully integrated 
with OpenPBqi^ and LSIO. A number of frames 
from scientific movies created on a cluster of 54 



CPU cores using the portal at (jINAF-Cataniaf ) 
are shown in figure [5j 

8.2. Operational Scenario 

Currently movies are created in VisiVOWeb by 
using several interfaces. Firstly, users can create 
fiy-throughs by moving a camera along a motion 
path which is prescribed within the domain of a 
dataset. 

Secondly, fiy-throughs can be produced by inter- 
mediate snapshots specified as camera positions / 
orientations; the system generates a movie with a 
camera path containing the specified positions / 
orientations. 

Thirdly, movies can be created by interpolating 
several steps of a time evolution. The user can 
browse a cosmological time evolution and choose 
two or more coherent datasets. The system will 
then produce the necessary number of intermedi- 
ate datasets by calculating particle positions and 
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Fig. 9. — The graphical user interface employed in VisIVOWeb for Splotch rendering. 
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applying boundary conditions as necessary. This 
approach can be very useful, e.g. in revealing 
galaxy formation or large-scale structures such as 
galaxy clusters. 

8.3. A Movie from a 512 Million Particle 
Simulation 

One of the most challenging tasks in performing 
large-scale cosmological simulations is monitoring 
the outcomes, so that if anomalies are observed the 
simulation run is corrected appropriately. Typ- 
ically such monitoring requires visualization of 
several snapshots at different red-shift, each being 
tens of GBs. 

We recently performed a large-scale structure 
of the Universe simulation (lOOMpc h^^) con- 
taining 800^ particles. The run was performed 
on the Cometa Con sortium Grid^^l using FLY 
(|Becciani et al. 20071) . The simulation was car- 
ried out over three months using 250 CPU cores; 
each data snapshot was 12 GBs. 
We employed VisIVOWcb firstly to extract 8 
million particles and secondly to identify a sub- 
region (15Mpc h~^) by using the operations de- 
scribed in section 5.4. We then interpolated 
the original snapshots in the time domain pro- 
ducing a large number of intermediate frames 
that have been used in creating a movie show- 
ing the fluid time evolution. Overall 1500 inter- 
mediate datascts were created requiring a CPU 
time of 2 months. The final movie employs 
a frame rate of 10 fps and can be found at 



http : //astrct . oact . inaf . it/vi sivo/cosmo/, 
The whole data archive handled for the movie 
creation was in excess of 18 TBs. 

9. VisIVOWeb Connect and Network 

VisIVOWeb Connect is a lightweight solution 
that allows seamless connection of VisIVOWeb to 
any astrophysical archive by adopting an http 
based approach. Using this technology users 
can view and manage datasets from any Internet 
archive directly within VisIVOWeb. The system 
can handle any data archive and it allows auto- 
matic uploading of datasets into a user-specified 
VisIVOWeb portal. 
To exploit the VisIVOWeb Connect functionality 



the relevant datasets must be encoded using a 
supported data format and be accessible through 
a supported data transfer protocol. Assuming 
that these conditions are observed, integrating 
VisIVOWeb Connect with any Internet archive is 
straightforward, simply requiring a URL contain- 
ing locations / datatypes of the datasets to be 
uploaded. 

Recent developments begun the deployment of a 
network of VisIVOWeb portals. At the time o f 
writing active nodes exist at ( INAF- Catania!) , 



(|INAF-Triestel ) and (jUniv. of Portsmouth! ) . The 
creation of a network of VisIVOWeb portals allows 
the user to choose the closest in order to mini- 
mize uploading times. Furthermore it allows to 
potentially customize different portals with tools 
specific to particular archives. 

10. Current Status and Future Work 

The visualisation functionality supported by 
VisIVOWcb is rich and wc arc currently in the 
process of developing several demonstrators to be 
used for formal evaluation in the context of spe- 
cific astrophysical communities using numerical 
simulations!^ but also real-world observational^. 
The initial feedback is very encouraging and we 
are working on fixing software bugs prior to our 
next major release, but also pursuing several en- 
hancements. 

Firstly, several new operations are planned to be 
implemented, to expand significantly the collec- 
tion of operations presented in section 5.4; the 
reader is referred to sourceforgc.net [^ for the 
newest developments. Secondly, we arc work- 
ing in upgrading the core visualisation func- 
tionality to the latest version of the Visualisa- 
tion Tool kit, and the rec ently released version of 
Splotch ( Jin et al. 20101) . The latter work is of 
particular importance as it offers the exciting pos- 
sibility for exploiting hybrid massively-parallel ar- 
chitectures containing large numbers of multicore 
CPUs and CUBA B enabled CPUs. To exploit 
high performance computing, we have already in- 
tegrated within our system the functionality to 
allow authenticated users to submit movie cre- 



^' ^http: //trow ■pi2s2.lt I 



^^http : //astrct . oact ■ Inaf ■ lt/vislvo/cosmo7] 

^^ ihttp : //www ■ sdss ■ org ■ uk/ 1 

^' ^http://sourcef orge.net/projects/vislvoweb/ 1 

^^ |http : //www . nvldla . com/ob j ect /cuda_home_new . html | 
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ation jobs using the EGEE grid resources, based 
on the computational capabihty provided by the 
grid operated by the COMETA consortium. 
Finahy, depending upon the user feedback ob- 
tained during the formal evaluation activity and 
within the scope of specific astrophysical commu- 
nities, we are planning to incorporate new func- 
tionality, e.g. for rendering streamlines and AMR 
datasets efficiently or designing highly customized 
opacity functions. Such algorithms either already 
exist in the underlying visualization libraries (but 
they are simply not enabled) or they can be de- 
veloped readily. This work will also dictate the 
direction of development for customizing future 
VisIVOWeb portals employing VisIVOWeb Con- 
nect. At the time of writing, this technology is 
used for connecting our system seamlessly with 
the Italia n Theoretical Virt ual Observatory web 
archive (jCosta et al. 20081 ) and the UK mirror 
of the Sloan Digi tal Sky Server (data release 7) 
CasJobs service (jCaniglia et al. 20091 ). 
There is a number of user guides for VisIVOWeb 
available to dow nload from th e nodes operat- 
ing currentlv (llNAF-Catanial : llNAF-Trie"st3 : 
Univ. of Portsmouthh . Each node also contains 
streaming videos of on-line tutorials, the relevant 
slides are available for download at the VisIVO 
portao. The underlying visualization technol- 
ogy documents are accessible at the svn reposi- 
tory. The functionality of our system has been 
presented in the VO-Dajo events run recently 
accross Italy, organized in the framework of Euro 
VO-AId43 and IVOA. 



functionality so that users can upload and man- 
age their datasets, e.g. by using interactive wid- 
gets to construct customised renderings or gener- 
ating meaningful astrophysical animations or even 
storing their results (movies or otherwise) for fu- 
ture reference. To the best of our knowledge, 
there are no other comparable open-source envi- 
ronments available to astrophysicists in the Vir- 
tual Observatory. 

We reviewed relevant work and introduced the way 
users can gain access to the system for import- 
ing their datasets. We discussed the underlying 
model for data management, focusing on the de- 
veloped database objects, the relevant interface 
and a number of currently available operations. 
We also presented the visualization algorithms 
supported by the system, founded on the Visu- 
alization Toolkit and Splotch, together with the 
relevant visualization interfaces including an in- 
teractive editor for constructing customized opac- 
ity functions. The operational scenario for gen- 
erating scientific movies from large-scale datasets 
was then introduced discussing a specific example 
for demonstration based on a large-scale simula- 
tion containing 512 million particles. The whole 
data handled by VisIVOWeb for this movie cre- 
ation was in excess of 18 TBs. We then in- 
troduced VisIVOWeb Connect/Network for net- 
working VisIVOWeb portals and connecting them 
seamlessly to existing astrophysical archives. Fi- 
nally wc concluded with a description of the cur- 
rent status and pointers to a number of future de- 
velopments. 



11. Summary 

Modern visualization can aid astrophysicists in 
gaining good insights of complex datasets, gen- 
erated either from real-world observations or nu- 
merical simulations, by providing suitable tools 
for rapid and intuitive visual discovery. We have 
introduced an open-source WWW environment 
called VisIVOWeb for visualization of large-scale 
astrophysical datasets offering astrophysicists an 
array of advanced visualization tools that can 
be exploited effectively using common Internet 
browsers. The system offers full data management 



•^ ^http : //vlsivo . pact ■ Inaf ■ It/ 1 
•^^http : //wMwas . oats . inaf ■ it/voday/] 
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